Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 408663 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 34.3 MiB |
| Average record size in memory | 88.0 B |
Variable types
| Categorical | 1 |
|---|---|
| DateTime | 1 |
| Numeric | 9 |
id_estacion has a high cardinality: 207 distinct values | High cardinality |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
longitud is highly correlated with latitud | High correlation |
latitud is highly correlated with longitud | High correlation |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
tmax is highly correlated with tmin | High correlation |
tmin is highly correlated with tmax | High correlation |
tmin is highly correlated with tmax and 2 other fields | High correlation |
longitud is highly correlated with altitud and 1 other fields | High correlation |
tmax is highly correlated with tmin and 2 other fields | High correlation |
altitud is highly correlated with tmin and 3 other fields | High correlation |
fecha_cnt is highly correlated with tmin and 1 other fields | High correlation |
latitud is highly correlated with longitud and 1 other fields | High correlation |
nevada is highly skewed (γ1 = 227.6590307) | Skewed |
prof_nieve is highly skewed (γ1 = 64.20443149) | Skewed |
precip has 163844 (40.1%) zeros | Zeros |
nevada has 408645 (> 99.9%) zeros | Zeros |
prof_nieve has 406496 (99.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-10-09 13:05:59.156280 |
|---|---|
| Analysis finished | 2021-10-09 13:06:16.795447 |
| Duration | 17.64 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 207 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| SP000009981 | 6040 |
|---|---|
| SP000008280 | 5807 |
| SP000003195 | 5283 |
| SPE00120629 | 5263 |
| SP000060010 | 5246 |
| Other values (202) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 4495293 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP000003195 |
|---|---|
| 2nd row | SP000003195 |
| 3rd row | SP000003195 |
| 4th row | SP000003195 |
| 5th row | SP000003195 |
Common Values
| Value | Count | Frequency (%) |
| SP000009981 | 6040 | 1.5% |
| SP000008280 | 5807 | 1.4% |
| SP000003195 | 5283 | 1.3% |
| SPE00120629 | 5263 | 1.3% |
| SP000060010 | 5246 | 1.3% |
| SPE00155259 | 5244 | 1.3% |
| SP000008027 | 4929 | 1.2% |
| SP000007038 | 4740 | 1.2% |
| SPE00119711 | 4734 | 1.2% |
| SPE00120458 | 4729 | 1.2% |
| Other values (197) | 356648 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| sp000009981 | 6040 | 1.5% |
| sp000008280 | 5807 | 1.4% |
| sp000003195 | 5283 | 1.3% |
| spe00120629 | 5263 | 1.3% |
| sp000060010 | 5246 | 1.3% |
| spe00155259 | 5244 | 1.3% |
| sp000008027 | 4929 | 1.2% |
| sp000007038 | 4740 | 1.2% |
| spe00119711 | 4734 | 1.2% |
| spe00120458 | 4729 | 1.2% |
| Other values (197) | 356648 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1375668 | |
| 1 | 568756 | |
| S | 408663 | 9.1% |
| P | 408663 | 9.1% |
| 2 | 343363 | 7.6% |
| E | 328009 | 7.3% |
| 5 | 204220 | 4.5% |
| 9 | 203451 | 4.5% |
| 6 | 148929 | 3.3% |
| 8 | 143944 | 3.2% |
| Other values (5) | 361627 | 8.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3338031 | |
| Uppercase Letter | 1157262 | 25.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1375668 | |
| 1 | 568756 | |
| 2 | 343363 | 10.3% |
| 5 | 204220 | 6.1% |
| 9 | 203451 | 6.1% |
| 6 | 148929 | 4.5% |
| 8 | 143944 | 4.3% |
| 3 | 135021 | 4.0% |
| 4 | 127379 | 3.8% |
| 7 | 87300 | 2.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 408663 | |
| P | 408663 | |
| E | 328009 | |
| W | 7522 | 0.6% |
| M | 4405 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3338031 | |
| Latin | 1157262 | 25.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1375668 | |
| 1 | 568756 | |
| 2 | 343363 | 10.3% |
| 5 | 204220 | 6.1% |
| 9 | 203451 | 6.1% |
| 6 | 148929 | 4.5% |
| 8 | 143944 | 4.3% |
| 3 | 135021 | 4.0% |
| 4 | 127379 | 3.8% |
| 7 | 87300 | 2.6% |
Latin
| Value | Count | Frequency (%) |
| S | 408663 | |
| P | 408663 | |
| E | 328009 | |
| W | 7522 | 0.6% |
| M | 4405 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4495293 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1375668 | |
| 1 | 568756 | |
| S | 408663 | 9.1% |
| P | 408663 | 9.1% |
| 2 | 343363 | 7.6% |
| E | 328009 | 7.3% |
| 5 | 204220 | 4.5% |
| 9 | 203451 | 4.5% |
| 6 | 148929 | 3.3% |
| 8 | 143944 | 3.2% |
| Other values (5) | 361627 | 8.0% |
fecha
Date
| Distinct | 6368 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| Minimum | 1896-11-01 00:00:00 |
|---|---|
| Maximum | 2021-08-15 00:00:00 |
Histogram with fixed size bins (bins=50)
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.6141662 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 14 |
| median | 27 |
| Q3 | 40 |
| 95-th percentile | 50 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 15.05100727 |
|---|---|
| Coefficient of variation (CV) | 0.5655261624 |
| Kurtosis | -1.196761577 |
| Mean | 26.6141662 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -7.462369122 Ć 10-5 |
| Sum | 10876225 |
| Variance | 226.53282 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 22 | 7939 | 1.9% |
| 18 | 7919 | 1.9% |
| 14 | 7917 | 1.9% |
| 19 | 7916 | 1.9% |
| 9 | 7904 | 1.9% |
| 10 | 7899 | 1.9% |
| 23 | 7876 | 1.9% |
| 27 | 7873 | 1.9% |
| 30 | 7873 | 1.9% |
| 32 | 7867 | 1.9% |
| Other values (43) | 329680 |
| Value | Count | Frequency (%) |
| 1 | 7867 | |
| 2 | 7779 | |
| 3 | 7792 | |
| 4 | 7793 | |
| 5 | 7857 | |
| 6 | 7708 | |
| 7 | 7623 | |
| 8 | 7632 | |
| 9 | 7904 | |
| 10 | 7899 |
| Value | Count | Frequency (%) |
| 53 | 1505 | 0.4% |
| 52 | 7841 | |
| 51 | 7829 | |
| 50 | 7826 | |
| 49 | 7814 | |
| 48 | 7837 | |
| 47 | 7826 | |
| 46 | 7812 | |
| 45 | 7824 | |
| 44 | 7832 |
| Distinct | 523 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.4411801 |
| Minimum | -109 |
|---|---|
| Maximum | 444 |
| Zeros | 54 |
| Zeros (%) | < 0.1% |
| Negative | 1403 |
| Negative (%) | 0.3% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -109 |
|---|---|
| 5-th percentile | 81 |
| Q1 | 147 |
| median | 199 |
| Q3 | 256 |
| 95-th percentile | 322 |
| Maximum | 444 |
| Range | 553 |
| Interquartile range (IQR) | 109 |
Descriptive statistics
| Standard deviation | 74.60750118 |
|---|---|
| Coefficient of variation (CV) | 0.3722164334 |
| Kurtosis | -0.4007092195 |
| Mean | 200.4411801 |
| Median Absolute Deviation (MAD) | 54 |
| Skewness | -0.04288181682 |
| Sum | 81912894 |
| Variance | 5566.279233 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 165 | 2186 | 0.5% |
| 163 | 2176 | 0.5% |
| 167 | 2142 | 0.5% |
| 159 | 2133 | 0.5% |
| 171 | 2128 | 0.5% |
| 173 | 2116 | 0.5% |
| 177 | 2104 | 0.5% |
| 157 | 2089 | 0.5% |
| 183 | 2087 | 0.5% |
| 181 | 2079 | 0.5% |
| Other values (513) | 387423 |
| Value | Count | Frequency (%) |
| -109 | 1 | |
| -104 | 1 | |
| -102 | 1 | |
| -101 | 1 | |
| -99 | 2 | |
| -97 | 2 | |
| -95 | 1 | |
| -93 | 2 | |
| -92 | 1 | |
| -90 | 1 |
| Value | Count | Frequency (%) |
| 444 | 1 | |
| 442 | 1 | |
| 441 | 1 | |
| 432 | 1 | |
| 431 | 1 | |
| 430 | 1 | |
| 428 | 1 | |
| 427 | 2 | |
| 426 | 1 | |
| 425 | 2 |
| Distinct | 438 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.13079971 |
| Minimum | -189 |
|---|---|
| Maximum | 275 |
| Zeros | 887 |
| Zeros (%) | 0.2% |
| Negative | 26537 |
| Negative (%) | 6.5% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -189 |
|---|---|
| 5-th percentile | -8 |
| Q1 | 52 |
| median | 100 |
| Q3 | 149 |
| 95-th percentile | 202 |
| Maximum | 275 |
| Range | 464 |
| Interquartile range (IQR) | 97 |
Descriptive statistics
| Standard deviation | 64.70299443 |
|---|---|
| Coefficient of variation (CV) | 0.6527032428 |
| Kurtosis | -0.5462835166 |
| Mean | 99.13079971 |
| Median Absolute Deviation (MAD) | 49 |
| Skewness | -0.1461837495 |
| Sum | 40511090 |
| Variance | 4186.477488 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 85 | 2402 | 0.6% |
| 101 | 2371 | 0.6% |
| 97 | 2358 | 0.6% |
| 87 | 2347 | 0.6% |
| 79 | 2346 | 0.6% |
| 75 | 2340 | 0.6% |
| 83 | 2337 | 0.6% |
| 91 | 2331 | 0.6% |
| 93 | 2328 | 0.6% |
| 89 | 2320 | 0.6% |
| Other values (428) | 385183 |
| Value | Count | Frequency (%) |
| -189 | 1 | |
| -181 | 1 | |
| -175 | 1 | |
| -174 | 1 | |
| -172 | 1 | |
| -171 | 1 | |
| -170 | 2 | |
| -169 | 1 | |
| -168 | 1 | |
| -166 | 1 |
| Value | Count | Frequency (%) |
| 275 | 1 | < 0.1% |
| 272 | 1 | < 0.1% |
| 271 | 1 | < 0.1% |
| 270 | 1 | < 0.1% |
| 269 | 2 | < 0.1% |
| 267 | 2 | < 0.1% |
| 266 | 2 | < 0.1% |
| 265 | 1 | < 0.1% |
| 264 | 1 | < 0.1% |
| 263 | 5 |
| Distinct | 457 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.27873823 |
| Minimum | 0 |
|---|---|
| Maximum | 1077 |
| Zeros | 163844 |
| Zeros (%) | 40.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3 |
| Q3 | 19 |
| 95-th percentile | 75 |
| Maximum | 1077 |
| Range | 1077 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 31.51973907 |
|---|---|
| Coefficient of variation (CV) | 1.936251976 |
| Kurtosis | 35.76920565 |
| Mean | 16.27873823 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 4.346156888 |
| Sum | 6652518 |
| Variance | 993.4939508 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 163844 | |
| 1 | 23184 | 5.7% |
| 2 | 14227 | 3.5% |
| 3 | 12296 | 3.0% |
| 4 | 10169 | 2.5% |
| 5 | 8984 | 2.2% |
| 6 | 8152 | 2.0% |
| 7 | 7500 | 1.8% |
| 8 | 6457 | 1.6% |
| 9 | 6283 | 1.5% |
| Other values (447) | 147567 |
| Value | Count | Frequency (%) |
| 0 | 163844 | |
| 1 | 23184 | 5.7% |
| 2 | 14227 | 3.5% |
| 3 | 12296 | 3.0% |
| 4 | 10169 | 2.5% |
| 5 | 8984 | 2.2% |
| 6 | 8152 | 2.0% |
| 7 | 7500 | 1.8% |
| 8 | 6457 | 1.6% |
| 9 | 6283 | 1.5% |
| Value | Count | Frequency (%) |
| 1077 | 1 | |
| 849 | 1 | |
| 811 | 1 | |
| 800 | 1 | |
| 690 | 1 | |
| 689 | 1 | |
| 671 | 1 | |
| 643 | 1 | |
| 641 | 1 | |
| 615 | 1 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0002814054612 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 408645 |
| Zeros (%) | > 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.05076044069 |
|---|---|
| Coefficient of variation (CV) | 180.3818606 |
| Kurtosis | 59458.75037 |
| Mean | 0.0002814054612 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 227.6590307 |
| Sum | 115 |
| Variance | 0.002576622339 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) |
| 0 | 408645 | |
| 7 | 4 | < 0.1% |
| 3 | 3 | < 0.1% |
| 2 | 3 | < 0.1% |
| 4 | 3 | < 0.1% |
| 8 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 408645 | |
| 2 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4 | 3 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 17 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 4 | < 0.1% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
| 2 | 3 | < 0.1% |
| 0 | 408645 |
| Distinct | 406 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4699520143 |
| Minimum | 0 |
|---|---|
| Maximum | 2385 |
| Zeros | 406496 |
| Zeros (%) | 99.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2385 |
| Range | 2385 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 16.33949568 |
|---|---|
| Coefficient of variation (CV) | 34.76843419 |
| Kurtosis | 5632.042288 |
| Mean | 0.4699520143 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 64.20443149 |
| Sum | 192052 |
| Variance | 266.9791191 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 406496 | |
| 1 | 502 | 0.1% |
| 3 | 248 | 0.1% |
| 4 | 183 | < 0.1% |
| 6 | 96 | < 0.1% |
| 7 | 70 | < 0.1% |
| 9 | 42 | < 0.1% |
| 10 | 34 | < 0.1% |
| 13 | 23 | < 0.1% |
| 16 | 22 | < 0.1% |
| Other values (396) | 947 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 406496 | |
| 1 | 502 | 0.1% |
| 2 | 6 | < 0.1% |
| 3 | 248 | 0.1% |
| 4 | 183 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 96 | < 0.1% |
| 7 | 70 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 42 | < 0.1% |
| Value | Count | Frequency (%) |
| 2385 | 1 | |
| 2098 | 1 | |
| 1901 | 1 | |
| 1743 | 1 | |
| 1714 | 2 | |
| 1706 | 1 | |
| 1700 | 1 | |
| 1593 | 1 | |
| 1472 | 1 | |
| 1414 | 1 |
| Distinct | 201 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.66839045 |
| Minimum | 27.8189 |
|---|---|
| Maximum | 43.5667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 27.8189 |
|---|---|
| 5-th percentile | 28.4775 |
| Q1 | 38.282 |
| median | 40.8206 |
| Q3 | 42.0831 |
| 95-th percentile | 43.3669 |
| Maximum | 43.5667 |
| Range | 15.7478 |
| Interquartile range (IQR) | 3.8011 |
Descriptive statistics
| Standard deviation | 3.765967051 |
|---|---|
| Coefficient of variation (CV) | 0.09493622019 |
| Kurtosis | 3.142702015 |
| Mean | 39.66839045 |
| Median Absolute Deviation (MAD) | 1.6313 |
| Skewness | -1.846323165 |
| Sum | 16211003.45 |
| Variance | 14.18250783 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 40.8206 | 6040 | 1.5% |
| 38.9519 | 5807 | 1.4% |
| 40.4117 | 5283 | 1.3% |
| 41.1144 | 5263 | 1.3% |
| 28.3089 | 5246 | 1.3% |
| 41.4181 | 5244 | 1.3% |
| 38.9892 | 5182 | 1.3% |
| 40.9478 | 4992 | 1.2% |
| 43.3075 | 4929 | 1.2% |
| 37.9769 | 4740 | 1.2% |
| Other values (191) | 355937 |
| Value | Count | Frequency (%) |
| 27.8189 | 2486 | |
| 27.9225 | 2561 | |
| 28.0475 | 2138 | |
| 28.3089 | 5246 | |
| 28.4444 | 2850 | |
| 28.4631 | 4729 | |
| 28.4775 | 4132 | |
| 28.6331 | 2807 | |
| 28.9517 | 2718 | |
| 35.2778 | 3097 |
| Value | Count | Frequency (%) |
| 43.5667 | 2750 | |
| 43.5606 | 2231 | |
| 43.5381 | 3259 | |
| 43.4917 | 2663 | |
| 43.4644 | 3813 | |
| 43.4292 | 3074 | |
| 43.3669 | 4734 | |
| 43.3606 | 3279 | |
| 43.3542 | 2542 | |
| 43.3075 | 4929 |
| Distinct | 206 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -3.446611876 |
| Minimum | -17.8889 |
|---|---|
| Maximum | 4.2156 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 286623 |
| Negative (%) | 70.1% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -17.8889 |
|---|---|
| 5-th percentile | -16.2553 |
| Q1 | -5.6492 |
| median | -3.4503 |
| Q3 | 0.4914 |
| 95-th percentile | 2.3767 |
| Maximum | 4.2156 |
| Range | 22.1045 |
| Interquartile range (IQR) | 6.1406 |
Descriptive statistics
| Standard deviation | 4.697730941 |
|---|---|
| Coefficient of variation (CV) | -1.362999697 |
| Kurtosis | 1.513917608 |
| Mean | -3.446611876 |
| Median Absolute Deviation (MAD) | 2.6053 |
| Skewness | -1.171040453 |
| Sum | -1408502.749 |
| Variance | 22.06867599 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -3.7892 | 6539 | 1.6% |
| 0.4914 | 6040 | 1.5% |
| -1.8631 | 5807 | 1.4% |
| -3.6781 | 5283 | 1.3% |
| -1.4106 | 5263 | 1.3% |
| -16.4992 | 5246 | 1.3% |
| 2.1239 | 5244 | 1.3% |
| -2.0392 | 4929 | 1.2% |
| 0.7106 | 4740 | 1.2% |
| -8.4192 | 4734 | 1.2% |
| Other values (196) | 354838 |
| Value | Count | Frequency (%) |
| -17.8889 | 2486 | |
| -17.755 | 2807 | |
| -16.5606 | 2138 | |
| -16.4992 | 5246 | |
| -16.3292 | 4132 | |
| -16.2553 | 4729 | |
| -15.3892 | 2561 | |
| -13.8631 | 2850 | |
| -13.6003 | 2718 | |
| -8.6494 | 1393 | 0.3% |
| Value | Count | Frequency (%) |
| 4.2156 | 2828 | |
| 3.1817 | 663 | 0.2% |
| 3.1658 | 663 | 0.2% |
| 3.0967 | 663 | 0.2% |
| 3.0353 | 663 | 0.2% |
| 3.0325 | 663 | 0.2% |
| 2.8342 | 246 | 0.1% |
| 2.8267 | 573 | 0.1% |
| 2.8253 | 3023 | |
| 2.8067 | 522 | 0.1% |
| Distinct | 173 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 419.6312517 |
| Minimum | 1 |
|---|---|
| Maximum | 2535 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 42 |
| median | 247 |
| Q3 | 656 |
| 95-th percentile | 1143 |
| Maximum | 2535 |
| Range | 2534 |
| Interquartile range (IQR) | 614 |
Descriptive statistics
| Standard deviation | 504.4009139 |
|---|---|
| Coefficient of variation (CV) | 1.202009888 |
| Kurtosis | 4.626454871 |
| Mean | 419.6312517 |
| Median Absolute Deviation (MAD) | 233 |
| Skewness | 1.96114448 |
| Sum | 171487766.2 |
| Variance | 254420.282 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4 | 12103 | 3.0% |
| 1 | 8690 | 2.1% |
| 35 | 7252 | 1.8% |
| 32 | 6994 | 1.7% |
| 44 | 6040 | 1.5% |
| 64 | 5951 | 1.5% |
| 5 | 5911 | 1.4% |
| 704 | 5807 | 1.4% |
| 87 | 5683 | 1.4% |
| 7 | 5650 | 1.4% |
| Other values (163) | 338582 |
| Value | Count | Frequency (%) |
| 1 | 8690 | |
| 2 | 1326 | 0.3% |
| 3 | 3200 | 0.8% |
| 4 | 12103 | |
| 5 | 5911 | |
| 6 | 2793 | 0.7% |
| 7 | 5650 | |
| 8 | 495 | 0.1% |
| 11 | 4482 | 1.1% |
| 14 | 3381 | 0.8% |
| Value | Count | Frequency (%) |
| 2535 | 660 | 0.2% |
| 2519 | 659 | 0.2% |
| 2451 | 679 | 0.2% |
| 2400 | 653 | 0.2% |
| 2371 | 5246 | |
| 2316 | 663 | 0.2% |
| 2266 | 663 | 0.2% |
| 2247 | 647 | 0.2% |
| 2230 | 663 | 0.2% |
| 2228 | 662 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's Ļ
The Spearman's rank correlation coefficient (Ļ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate Ļ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's Ļ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (Ļ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate Ļ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. Ļ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (Ļk)
Phik (Ļk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| id_estacion | fecha | fecha_cnt | tmax | tmin | precip | nevada | prof_nieve | longitud | latitud | altitud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | SP000003195 | 1920-01-04 | 1 | 96.0 | 41.0 | 4.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 1 | SP000003195 | 1920-01-11 | 2 | 81.0 | 5.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 2 | SP000003195 | 1920-01-18 | 3 | 117.0 | 21.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 3 | SP000003195 | 1920-01-25 | 4 | 118.0 | 19.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 4 | SP000003195 | 1920-02-01 | 5 | 106.0 | 31.0 | 2.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 5 | SP000003195 | 1920-02-08 | 6 | 113.0 | 8.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 6 | SP000003195 | 1920-02-15 | 7 | 119.0 | 6.0 | 0.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 7 | SP000003195 | 1920-02-22 | 8 | 114.0 | 62.0 | 99.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 8 | SP000003195 | 1920-02-29 | 9 | 125.0 | 74.0 | 29.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
| 9 | SP000003195 | 1920-03-07 | 10 | 137.0 | 64.0 | 22.0 | 0.0 | 0.0 | 40.4117 | -3.6781 | 667.0 |
Last rows
| id_estacion | fecha | fecha_cnt | tmax | tmin | precip | nevada | prof_nieve | longitud | latitud | altitud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 408653 | SPW00014011 | 1967-10-29 | 43 | 181.0 | 83.0 | 11.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408654 | SPW00014011 | 1967-11-05 | 44 | 133.0 | 23.0 | 16.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408655 | SPW00014011 | 1967-11-12 | 45 | 141.0 | 26.0 | 20.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408656 | SPW00014011 | 1967-11-19 | 46 | 130.0 | 57.0 | 68.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408657 | SPW00014011 | 1967-11-26 | 47 | 135.0 | 65.0 | 18.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408658 | SPW00014011 | 1967-12-03 | 48 | 126.0 | 22.0 | 1.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408659 | SPW00014011 | 1967-12-10 | 49 | 87.0 | -13.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408660 | SPW00014011 | 1967-12-17 | 50 | 64.0 | -60.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408661 | SPW00014011 | 1967-12-24 | 51 | 64.0 | -16.0 | 2.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |
| 408662 | SPW00014011 | 1967-12-31 | 52 | 82.0 | -19.0 | 0.0 | 0.0 | 0.0 | 40.4833 | -3.45 | 608.1 |